Weighted real-time heuristic search
نویسندگان
چکیده
Multiplying the heuristic function by a weight greater than one is a well-known technique in Heuristic Search. When applied to A* with an admissible heuristic it yields substantial runtime savings, at the expense of sacrificing solution optimality. Only a few works have studied the applicability of this technique to Real-Time Heuristic Search (RTHS), a search approach that builds upon Heuristic Search. In this paper we present two novel approaches to using weights in RTHS. The first one is a variant of a previous approach by Shimbo and Ishida. It incorporates weights to the lookahead search phase of the RTHS algorithm. The second one incorporates the weight to the edges of the search graph during the learning phase. Both techniques are applicable to a wide class of RTHS algorithms. Here we implement them within LSS-LRTA* and LRTA*-LS, obtaining a family of new algorithms. We evaluate them in path-planning benchmarks and show the second technique yields improvements of up to one order-of-magnitude both in solution cost and total search time. The first technique, on the other hand, yields poor results. Furthermore, we prove that RTHS algorithms that can appropriately use our second technique terminate finding a solution if one exists.
منابع مشابه
Weighted Lateral Learning in Real-Time Heuristic Search
Real-time heuristic search models an autonomous agent solving a search task. The agent operates in a real-time setting by interleaving local planning, learning and move execution. In this paper we propose a simple parametric algorithm that combines weighting with learning from multiple neighbors. Doing so breaks heuristic admissibility but allows the agent to escape heuristic depressions more q...
متن کاملTime-Bounded Best-First Search for Reversible and Non-reversible Search Graphs
Time-Bounded A* is a real-time, single-agent, deterministic search algorithm that expands states of a graph in the same order as A* does, but that unlike A* interleaves search and action execution. Known to outperform state-of-the-art real-time search algorithms based on Korf’s Learning Real-Time A* (LRTA*) in some benchmarks, it has not been studied in detail and is sometimes not considered as...
متن کاملA novel heuristic algorithm for capacitated vehicle routing problem
The vehicle routing problem with the capacity constraints was considered in this paper. It is quite difficult to achieve an optimal solution with traditional optimization methods by reason of the high computational complexity for large-scale problems. Consequently, new heuristic or metaheuristic approaches have been developed to solve this problem. In this paper, we constructed a new heuristic ...
متن کاملLearning for Adaptive Real-time Search
Abstract. Real-time heuristic search is a popular model of acting and learning in intelligent autonomous agents. Learning real-time search agents improve their performance over time by acquiring and refining a value function guiding the application of their actions. As computing the perfect value function is typically intractable, a heuristic approximation is acquired instead. Most studies of l...
متن کاملحل مسئله زمان بندی ماشینهای موازی نامرتبط با اهداف کل زودکرد وزنی و کل دیرکرد وزنی با استفاده از الگوریتم جستجوی پراکنده چند هدفه
The parallel machine scheduling problem is an important and difficult problem to be considered in the real-world situations. Traditionally, this problem consists of the scheduling of a set of independent jobs on parallel machines with the aim of minimizing the maximum job completion. In today's manufacturing systems, in which both early and tardy finishing of job processing are undesired, the o...
متن کاملModeling the Time Windows Vehicle Routing Problem in Cross-Docking Strategy Using Two Meta-Heuristic Algorithms
In cross docking strategy, arrived products are immediately classified, sorted and organized with respect to their destination. Among all the problems related to this strategy, the vehicle routing problem (VRP) is very important and of special attention in modern technology. This paper addresses the particular type of VRP, called VRPCDTW, considering a time limitation for each customer/retai...
متن کامل